Text Mining Grid Services for Multiple Environments
نویسندگان
چکیده
The objective of this paper is to describe the implementation of text mining grid services for Aîuri Project, which is a framework that includes a friendly user interface, data and text mining tasks, database access and a visualization tool integrated with various grid environments. The focus is the development and test of components for analysis and evaluation of unstructured data into distinct grid environments. These components will be grid services for text mining processes using several approaches of execution, depending on which grid environment the user choose to submit. All components are open source and are freely available to the scientific community, providing access to existing services as well as encouraging the addition of new ones.
منابع مشابه
WS-DAI-DM: An Interface Specification for Data Mining in Grid Environments
Providing the appropriate access means for data mining services in Grid Environment is principal for combination of Grid and data mining. The transition from centralized data mining process as they are in traditional tools to Grid-compliant and Grid-based data mining services that can coordinate with each other is important to extract useful and potential knowledge/patterns from distributed dat...
متن کاملMulti-agent Web Text Mining on the Grid for Enterprise Decision Support
In this study, a multi-agent web text mining system on the grid is developed to support enterprise decision-making. First, an individual intelligent learning agent that learns about underlying text documents is presented to discover the useful knowledge for enterprise decision. In order to scale the individual intelligent agent with the large number of text documents on the web, we then provide...
متن کاملGrid-enabled Support for Classification and Clustering of Textual Documents
This paper presents the fusion of two approaches – Grid and Grid computing and text mining. GridMiner is a system developed at University of Vienna, and it is a framework for knowledge discovery process in the distributed Grid environment. JBOWL is a framework for text mining and information retrieval being developed at Technical University in Košice. Text mining provides some methods (includin...
متن کاملGrid - based Distributed Data Mining Systems , Algorithms and Services ∗
Distribution of data and computation allows for solving larger problems and execute applications that are distributed in nature. The Grid is a distributed computing infrastructure that enables coordinated resource sharing within dynamic organizations consisting of individuals, institutions, and resources. The Grid extends the distributed and parallel computing paradigms allowing resource negoti...
متن کاملDesigning data analysis services in the Knowledge Grid
Grid environments were originally designed for dealing with problems involving compute-intensive applications. Today, however, grids enlarged their horizon as they are going to manage large amounts of data and run business applications supporting consumers and end users. To face these new challenges, grids must support adaptive data management and data analysis applications by offering resource...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008